Long range Ising model for credit risk modeling in homogeneous portfolios 
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Within the framework of maximum entropy principle we show that the finite-size long-range 
Ising model is the adequate model for the description of homogeneous credit portfolios and the 
computation of credit risk when default correlations between the borrowers are included. The exact 
analysis of the model suggest that when the correlation increases a first-order-like transition may 
occur inducing a sudden risk increase. Such a feature is not reproduced by the standard models 
used in credit risk modeling. 



The modelization and control of risk is a subject of ex- 
treme importance in a rich variety of disciplines ranging 
from geology to medicine. To know which is the prob- 
ability distribution of rare events that may cause very 
important losses is the key point to be solved and to 
which a lot of efforts have been devoted for many years. 
Within the framework of finance, the most studied case 
is that of market risk (the risk associated to uncertain- 
ties of capital markets). But, by far, the risk that most 
concern practitioners (both due to its negative impact 
and its modeling difficulties) is credit risk |l|: a bank or 
financial institution give credits to N borrowers which 
may fail to return it. Such a group of borrowers consti- 
tutes a so called credit portfolio. It is very important to 
model which is the probability p(L) that L (0 < L < N) 
of such borrowers will fail in returning the credit. The 
knowledge ofp(L) is of extreme importance to decide the 
total amount of capital and reserves that the bank should 
hold to prevent a failure and for economic and regulatory 
reasons. Also p{L) is the main driver in the valuation and 
hedging of credit derivatives. 

Banks have been using many alternative models. Most 
of these, as a first step, consider the case of homogeneous 
portfolios for which the properties of the borrowers and 
their credits are supposed to be the same. In this paper 
we will also restrict to this homogeneous case. In the 
financial literature the description of a portfolio starts by 
defining the "losses" U (i = 1, . . . , N) which takes values 
and 1 depending on whether the borrower i returns the 
credit or fails to return it. For convenience, we define 
Si = 1 — 2li which takes values +1 when a borrower 
returns the credit and —1 when it fails to return it. Note 
that the number of losses is L = J2i h — (N — J^i Si)/2. 

The simplest model is the one of independent borrow- 
ers. This is characterized by a single parameter which is 
the probability of default p. In this case, the loss proba- 
bility is given by the binomial distribution: 



The expected value of the number of losses is (L) = 
N(k) = Np and its variance is V(L) = N 2 V(k) = 
N 2 p(l — p). (The corresponding expressions in terms of 
the Si variables can be easily derived taking into account 
that (Si) = N(l-2p) and V(Si) = Ap(l~p).) The main 
drawback of this model is that it does not contain the 
possibility for correlations between borrowers which are 
known to exist and play an important role. Although in 
some cases the links between borrowers can be difficult 
to understand, in many cases borrowers tend to default 
in a cooperative manner. 

The definition of the default correlation p in terms of 
the Si variables is given by: 



(S i S j )-(S i )(S j 



V(s'i)-(s t )^(s 2 )-(s 3 y 



(i^j)- (2) 



The same formula applies for the U variables, as can be 
checked straightforwardly. To include correlations differ- 
ent theoretical models have been proposed by modify- 
ing the model of independent borrowers including phe- 
nomenological hypothesis on how p depends on external 
factors |2|. For instance, the economic conditions can be 
described by a latent variable y which is a random vari- 
able with a certain probability density f{y). Then, the 
concept of default probability p is substituted by the con- 
cept of conditional default probability <j>(y) such that for 
a fixed value of y the probability of L defaults is binomial 
with p = (f>(y). Therefore, 



P(L) 



dyf(y) 



[l-<j>(y)] N ~ L <b(y)\ (3) 



Such conditionally independent models (CIM) are typ- 
ically characterized by two parameters which could be 
empirically determined. First, the average (also called 
unconditional) default probability given by: 



Pb(L) 



(1-P) 



N-L p L_ 



(1) 



P = (h) 



dy<t>(y)f(y)- 



(4) 
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Second, the default correlation which, for the case of ho- 
mogeneous portfolios, is given by: 



fZottvFMdv- fZ<t>(y)f(y)dy 



S-oo <t>(y)f(y) d y - SZo <Ky)/(y) d 2/ 



(5) 



For instance, the Merton based model (used by JPMor- 
gan and KMV companies in their credit portfolio models) 
corresponds to the assumption that f(y) is the N(0, 1) 
Gaussian probability density and the function <f> is: 



My) = * 



- \/qy\ _ 



l 



/2tt 



e~ z2/2 dz, (6) 



where $ is the error function, c is the "critical value" and 
q is the "asset correlation" (which should not be confused 
with the default correlation). For this case it is easy to 
check that the average default probability is p = <E"(c), 
and that the default correlation is: 



P = 



du dv 



^-(u 2 +v 2 -2quv)/2(l-q 2 ) 



(7) 



Other examples of CIM are the CreditPortfolioView 
model (of McKinsey) and the CreditRisk+ model (of 
Credit Suisse Financial Products) which use different 
functions <fi(y) and f(y) 0,0]. The CIM have been shown 
to only capture cyclic correlations, leavingaside direct 
correlations, contagion and cascade effects pf- 

In this letter we propose a different method to model 
a credit portfolio that includes default correlations in a 
fundamentally different way. We describe portfolios with 
vectors X = (Si, S2, ■ ■ • , Sn). The set of all possible 2 N 
vectors X is, therefore, our phase space Q = {X}. To 
solve a credit portfolio model is to assign a probability 
law P{X) on il. Instead of trying to derive which are the 
dependencies between the different borrowers or between 
the borrowers and the external factors, we will assume 
that default correlation p is different from zero and derive 
P(X) from the Maximum Statistical Entropy principle. 
This principle, which is essential in Information Theory, 
has been widely used for building probability distribu- 
tions when a underlying theory is lacking. It states that 
one should consider the model P(X) that maximizes the 
entropy functional: 



S[P] = Y,P(X)lnP(X), 
xen 



(8) 



subject to the conditions imposed by the previous known 
information. Such a method allows to control what are 
the exact ingredients (or the exact information) that is 
taken into account to formulate the model. This is op- 
posite to the above presented theoretical models which 
assume a rigid structure for relations between borrowers 
that cannot be empirically checked. 



It is instructive to first present the derivation of the 
simplest model. We will impose only two conditions. 
First the normalization condition: 



1. 



(9) 



xen 



Second we assume that we have a previous knowledge of 
the expected value of the number of losses (L) or, equiv- 
alently, of the average default probability p. In terms of 
the probability law P(X) this condition is written as: 



<]T S t ) = £ P(X)M(X) = N(l 2p), 



(10) 



xen 



where we have defined the function M(X) — Si. We 
want to stress that the knowledge of the average default 
probability p allows to fix the expected value of M but 
not its exact value. Maximize (JSJ) with conditions J§J| and 
(|10fl is equivalent to maximize the Lagrange functional: 

MP] = £ P ( X )m P ( X )- A (E P W-l) 

xen \xen ) 

- a[Y J M(X)P(X)-N(l-2p)Y (11) 
\xen J 

where A and a are Lagrange multipliers. The result of 
the maximization is P(X) = e x - 1 + aM i x ) , xhe param- 
eter A can be easily deduced imposing ©. To obtain a 
we require (M(X)) = N(l — 2p). A straightforward alge- 
bra renders a = tanh _1 (l — 2p). The probability P(X) 
results in: 



N + M(X) N-MiX) 

P{X) = {l-p)^ LA p-^ LA . 



(12) 



Note that this probability depends on X only through 
the function M(X). Since there are (^) different vectors 
that give the same loss L, the probability p{L) turns out 
to be Eq. substituting p by p. 

The inclusion of the existence of correlations between 
borrowers into the model requires to introduce a third 
condition in the maximization of (JSJ) . In this case, the 
symmetric function of X whose expected value is fixed 
by the default correlation p is: 



H{X) = 



N-l^ SiSj - 

i>j 



(13) 



Note that the number of terms in the sum is N(N — 1) /2 
so that the pre-factor 2/(N — 1) ensures that H(X) cx N . 
Using ©, and the fact that (Sf) = 1 it is straigfor- 
ward to check that the knowledge of p allows to fix the 
average value of (H(X)) according to: 



(H(X))=N[l-4p(l-p)(l-p)]. 



(14) 
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Imposing this condition, together with conditions © and 
(|10|l . leads to the following Lagrange functional: 



Hf[P] = * [P]-i3 



xen 



(15) 



Diferentiating and imposing © to eliminate A one ob- 
tains: 



P(X) 



yM(X)+/3H(X) 



xen 



e aM(X)+f3H(X) ' 



(16) 



This can be be rewriten as P(X) = e /£ 



with: 



xen ' 



-H 



N 



(17) 



i=i 



which is the canonical probability distribution corre- 
sponding to the well known long-range Ising model 
(LRIM) of a finite system with N spins and a conve- 
nient redefinition of the exchange constant and the ex- 
ternal field jjj. Note that P(X) in (|16|l . again, is only 
a function of M{X) since H(X) = ^ (M(X) 2 - N). 
Therefore the probability of L losses will be obtained by 
multiplying expression l|16|) by the binomial number f , ) ■ 

To determine the values of a and (3 in 116fl that allow to 
impose conditions (|10|l and Ijl4|l for given values of p and 
p is a difficult task that has been extensively studied in 
the field of Statistical Mechanics. Note that the problem 
when N — > oo can be easily solved by a saddle point 
method. This method renders a solution that coincides 
with the solution obtained by the mean field hypothesis 
0. But, such an approximation is of little interest here 
because of two reasons: credit portfolios, although being 
large, are quite far from the thermodynamic limit and 
such a mean field approximation inmediately leads to p — 
(except for the unrealistic case of p — 0.5). 

Fig-fflshows an example of the probability p(L) derived 
by the LRIM corresponding to p — 0.02 and p = 0.01 
(a = 8.85 10~ 3 , = 1.01148). The behaviour is com- 
pared with the binomial model (without correlation) and 
the Merton based model. In this last case the parame- 
ters (c = —2.054 and q = 0.0718) have been calibrated 
in order to have the same values of p and p = 0.01. Note 
that the LRIM is the only model able to capture the exis- 
tence of a second peak in the large L region (negative M 
region) that appears due to the existence of correlations 
which favour a collective behaviour of the borrowers. 

Although the peak in the right (large L region) is usu- 
ally very small compared to the peak in the left (small 
L region) it has strong implications in the evaluation of 
credit risk. There are different strategies to measure risk. 
A common one is the computation of the so called Value- 
at-Risk (VaR): this is the number of failures A that must 
be considered in order to ensure that the probability of 



having more than A fails is smaller than a certain tiny 
confidence level a c . Thus A is given by the solution of 
the equation: 



N 

E 

L=A 



P(L) 



(18) 




FIG. 1: Comparison of the probability of L failures p(L) (in 
linear-log scale) obtained from the LRIM (■), the binomial 
model (O) an d the Merton based model (A) with N = 500 
and p = 0.02. Both the LRIM and the Merton model corre- 
spond to p = 0.01. The inset shows the peak on the left in 
linear scale. The scales on top show the corresponding values 
of M. 



The smaller the confidence level, the larger A. Fig. [21 
shows an example of the behaviour of the sum in the left 
of (|18fl as a function of a variable A. The horizontal line 
indicates the confidence level. Note that given the two- 
peaked shape oip(L), the sum in (|18l) exhibitis a constant 
plateau. Consequently a small change in p may produce a 
drastic change in the position of the VaR (crossing point). 

0,000105 r 



o-o 0.0098 
" 0.0099 
m 0.0100 



' ooooooooooo 



FIG. 2: Calculation of the VaR for a given confidence level 
Qf c = 0.0001 (dashed line). The continuous lines with symbols 
show the behaviour of the left hand side in equation 1181 as a 
function of A for the LRIM with p = 0.01 and different values 
of p, as indicated by the legend. 
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From the physical point of view, this change is very sim- 
ilar to a first-order phase transition in which the order 
parameter (VaR) exhibits a discontinuity. The main dif- 
ference is that our system is small and therefore real dis- 
continuities do not exhist but, instead, there are sharp 
changes. In order to make clear this sharp behaviour 
we show in Figure the VaR as a function of p for 
p = 0.01 and a c = 0.0001. The Merton model displays an 
smooth increase of the VAR with increasing correlation. 
For small correlations the differences between the LRIM 
and the binomial one are negligible. However, there ex- 
ist a value of the correlations for which, according to the 
LRIM, a dramatic increase in the VaR occurs, which is 
not reproduced by the Merton model. 




default correlation p 



FIG. 3: VaR as a function of the default correlation p for 
the LRIM, the Merton model and the binomial model (inde- 
pendent of p). The three models correspond to N = 500 and 
p — 0.01. The inset shows the position of the sharp increases 
of the VaR in the p-p space predicted by the LRIM, for dif- 
ferent values of the confidence level a c , as indicated by the 
legend. 

It is of paramount importance to locate the region where 
such strong change in the VaR occurs for different values 
of the confidence level a c . This is shown in the inset of 
Fig. The linear axes represent the space of empiri- 
cal parameters p and p. The different lines, correspond- 
ing to different confidence levels, separate the region in 
which the behaviour of the LRIM is similar to the bi- 
nomial model (below) from the region where collective 
failure may occur and thus predictions with the bino- 
mial model strongly uderstimate the VaR. Other mea- 
surements of risk (like the expected shortfall) may exhibit 
even stronger diferences. 

There are a series of possible extensions of the 
LRIM model: first, the extension to the case of non- 
homogeneous portfolios is straightforward. The borrow- 
ers need to be separated in different tranches. For each 
tranche the formulation of the model will require the 
knowledge of the default probability, the autocorrelation 
within each tranche and the correlations among different 



tranches. Note that in the LRIM model will be easy to 
introduce negative correlations between certain tranches 
to represent big competitors in small markets. This is im- 
possible to reproduce with the conditional independent 
models. A second interesting extension will be to change 
the discrete variables Si to continuous variables fa. The 
Ising model, then, be substituted by a 4> 4 model. This 
will enable to describe the fact that when a borrower 
defaults there is always a recovery of part of the loan. 
A third extension could be the inclusion of randomness 
in the LRIM. For instance random fields and/or random 
bonds would allow to take into account the uncertainties 
in the knowledge of the empirical values of p and p. Fi- 
nally one could include higher order interaction terms, 
like three spin terms. This would allow to introduce, 
in a controled way, the existence of three borrower cor- 
relations which are know to exist when loans are fully 
guaranteed by third parties. Note that the conditional 
independent models do also introduce higher order cor- 
relations, but in a completely uncontrolled manner. 

Within the framework of information theory, it has 
been shown that the LRIM is the natural model to be 
used for the description of credit portfolios and risk mod- 
elling when knowledge about the existence of correlations 
is taken into account. One of the main results is that risk 
measurements can display big and sharp increases due to 
default correlations, in regions of the space parameters 
where other models fail to predict them. The discussion 
presented also shows that the LRIM can be useful not 
only in finance but, in general, for any complex system 
described by binary variables for which the available in- 
formation is the average value and the correlations. 
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